Overview
Brought to you by YData
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 6325 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 741.2 KiB |
| Average record size in memory | 120.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 4 |
| Boolean | 1 |
age_of_first_emp is highly overall correlated with cb_person_cred_hist_length and 2 other fields | High correlation |
cb_person_cred_hist_length is highly overall correlated with age_of_first_emp and 1 other fields | High correlation |
cb_person_default_on_file is highly overall correlated with loan_grade and 1 other fields | High correlation |
loan_amnt is highly overall correlated with loan_percent_income | High correlation |
loan_grade is highly overall correlated with cb_person_default_on_file and 1 other fields | High correlation |
loan_int_rate is highly overall correlated with cb_person_default_on_file and 1 other fields | High correlation |
loan_percent_income is highly overall correlated with loan_amnt | High correlation |
person_age is highly overall correlated with age_of_first_emp and 1 other fields | High correlation |
person_emp_length is highly overall correlated with age_of_first_emp | High correlation |
id has unique values | Unique |
person_emp_length has 837 (13.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-11 17:37:15.617006 |
|---|---|
| Analysis finished | 2025-03-11 17:37:24.961975 |
| Duration | 9.34 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
id
Real number (ℝ)
Unique 
| Distinct | 6325 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29261.091 |
| Minimum | 7 |
|---|---|
| Maximum | 58640 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 98.8 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 2956.4 |
| Q1 | 14932 |
| median | 29383 |
| Q3 | 43728 |
| 95-th percentile | 55345.2 |
| Maximum | 58640 |
| Range | 58633 |
| Interquartile range (IQR) | 28796 |
Descriptive statistics
| Standard deviation | 16689.773 |
|---|---|
| Coefficient of variation (CV) | 0.57037427 |
| Kurtosis | -1.1799158 |
| Mean | 29261.091 |
| Median Absolute Deviation (MAD) | 14427 |
| Skewness | -0.0005590279 |
| Sum | 1.850764 × 108 |
| Variance | 2.7854853 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 38914 | 1 | < 0.1% |
| 38854 | 1 | < 0.1% |
| 38852 | 1 | < 0.1% |
| 38847 | 1 | < 0.1% |
| 38845 | 1 | < 0.1% |
| 38839 | 1 | < 0.1% |
| 38834 | 1 | < 0.1% |
| 38804 | 1 | < 0.1% |
| 38793 | 1 | < 0.1% |
| Other values (6315) | 6315 |
| Value | Count | Frequency (%) |
| 7 | 1 | |
| 10 | 1 | |
| 15 | 1 | |
| 32 | 1 | |
| 37 | 1 | |
| 38 | 1 | |
| 62 | 1 | |
| 68 | 1 | |
| 69 | 1 | |
| 73 | 1 |
| Value | Count | Frequency (%) |
| 58640 | 1 | |
| 58621 | 1 | |
| 58611 | 1 | |
| 58563 | 1 | |
| 58549 | 1 | |
| 58546 | 1 | |
| 58532 | 1 | |
| 58515 | 1 | |
| 58498 | 1 | |
| 58492 | 1 |
person_age
Real number (ℝ)
High correlation 
| Distinct | 46 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.608854 |
| Minimum | 20 |
|---|---|
| Maximum | 76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 98.8 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 23 |
| median | 26 |
| Q3 | 30 |
| 95-th percentile | 40 |
| Maximum | 76 |
| Range | 56 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 6.112149 |
|---|---|
| Coefficient of variation (CV) | 0.22138366 |
| Kurtosis | 5.4637559 |
| Mean | 27.608854 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.912348 |
| Sum | 174626 |
| Variance | 37.358366 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 813 | |
| 22 | 769 | |
| 24 | 697 | |
| 25 | 564 | 8.9% |
| 27 | 426 | 6.7% |
| 26 | 423 | 6.7% |
| 28 | 402 | 6.4% |
| 29 | 350 | 5.5% |
| 30 | 238 | 3.8% |
| 31 | 212 | 3.4% |
| Other values (36) | 1431 |
| Value | Count | Frequency (%) |
| 20 | 4 | 0.1% |
| 21 | 194 | 3.1% |
| 22 | 769 | |
| 23 | 813 | |
| 24 | 697 | |
| 25 | 564 | |
| 26 | 423 | |
| 27 | 426 | |
| 28 | 402 | |
| 29 | 350 |
| Value | Count | Frequency (%) |
| 76 | 1 | < 0.1% |
| 73 | 2 | < 0.1% |
| 66 | 2 | < 0.1% |
| 65 | 2 | < 0.1% |
| 64 | 1 | < 0.1% |
| 62 | 1 | < 0.1% |
| 60 | 2 | < 0.1% |
| 58 | 7 | |
| 57 | 2 | < 0.1% |
| 56 | 1 | < 0.1% |
person_income
Real number (ℝ)
| Distinct | 1005 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 63778.993 |
| Minimum | 4200 |
|---|---|
| Maximum | 1839784 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 98.8 KiB |
Quantile statistics
| Minimum | 4200 |
|---|---|
| 5-th percentile | 26000 |
| Q1 | 40000 |
| median | 56000 |
| Q3 | 76500 |
| 95-th percentile | 120000 |
| Maximum | 1839784 |
| Range | 1835584 |
| Interquartile range (IQR) | 36500 |
Descriptive statistics
| Standard deviation | 55045.583 |
|---|---|
| Coefficient of variation (CV) | 0.86306761 |
| Kurtosis | 411.26173 |
| Mean | 63778.993 |
| Median Absolute Deviation (MAD) | 17000 |
| Skewness | 15.658839 |
| Sum | 4.0340213 × 108 |
| Variance | 3.0300162 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40000 | 460 | 7.3% |
| 80000 | 343 | 5.4% |
| 60000 | 260 | 4.1% |
| 48000 | 153 | 2.4% |
| 50000 | 147 | 2.3% |
| 30000 | 141 | 2.2% |
| 70000 | 126 | 2.0% |
| 45000 | 115 | 1.8% |
| 120000 | 103 | 1.6% |
| 75000 | 98 | 1.5% |
| Other values (995) | 4379 |
| Value | Count | Frequency (%) |
| 4200 | 1 | < 0.1% |
| 5000 | 1 | < 0.1% |
| 9600 | 6 | |
| 10000 | 1 | < 0.1% |
| 12000 | 11 | |
| 12996 | 1 | < 0.1% |
| 13000 | 1 | < 0.1% |
| 14400 | 14 | |
| 15000 | 6 | |
| 15120 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1839784 | 1 | |
| 1824000 | 1 | |
| 1200000 | 1 | |
| 948000 | 1 | |
| 928000 | 1 | |
| 900000 | 1 | |
| 889000 | 1 | |
| 828000 | 1 | |
| 612000 | 1 | |
| 510000 | 1 |
person_home_ownership
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 98.8 KiB |
| RENT | |
|---|---|
| MORTGAGE | |
| OWN | |
| OTHER | 8 |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 5.6072727 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RENT |
|---|---|
| 2nd row | MORTGAGE |
| 3rd row | OWN |
| 4th row | RENT |
| 5th row | MORTGAGE |
Common Values
| Value | Count | Frequency (%) |
| RENT | 3320 | |
| MORTGAGE | 2631 | |
| OWN | 366 | 5.8% |
| OTHER | 8 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rent | 3320 | |
| mortgage | 2631 | |
| own | 366 | 5.8% |
| other | 8 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 5959 | |
| E | 5959 | |
| T | 5959 | |
| G | 5262 | |
| N | 3686 | |
| O | 3005 | |
| M | 2631 | |
| A | 2631 | |
| W | 366 | 1.0% |
| H | 8 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 35466 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 5959 | |
| E | 5959 | |
| T | 5959 | |
| G | 5262 | |
| N | 3686 | |
| O | 3005 | |
| M | 2631 | |
| A | 2631 | |
| W | 366 | 1.0% |
| H | 8 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 35466 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 5959 | |
| E | 5959 | |
| T | 5959 | |
| G | 5262 | |
| N | 3686 | |
| O | 3005 | |
| M | 2631 | |
| A | 2631 | |
| W | 366 | 1.0% |
| H | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 35466 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 5959 | |
| E | 5959 | |
| T | 5959 | |
| G | 5262 | |
| N | 3686 | |
| O | 3005 | |
| M | 2631 | |
| A | 2631 | |
| W | 366 | 1.0% |
| H | 8 | < 0.1% |
person_emp_length
Real number (ℝ)
High correlation  Zeros 
| Distinct | 29 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.7459289 |
| Minimum | 0 |
|---|---|
| Maximum | 41 |
| Zeros | 837 |
| Zeros (%) | 13.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 98.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 12 |
| Maximum | 41 |
| Range | 41 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.0339857 |
|---|---|
| Coefficient of variation (CV) | 0.84998865 |
| Kurtosis | 2.7371902 |
| Mean | 4.7459289 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.2767834 |
| Sum | 30018 |
| Variance | 16.27304 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 837 | |
| 2 | 788 | |
| 3 | 696 | |
| 1 | 581 | |
| 5 | 566 | |
| 4 | 562 | |
| 6 | 531 | |
| 7 | 454 | |
| 8 | 343 | |
| 9 | 244 | 3.9% |
| Other values (19) | 723 |
| Value | Count | Frequency (%) |
| 0 | 837 | |
| 1 | 581 | |
| 2 | 788 | |
| 3 | 696 | |
| 4 | 562 | |
| 5 | 566 | |
| 6 | 531 | |
| 7 | 454 | |
| 8 | 343 | |
| 9 | 244 | 3.9% |
| Value | Count | Frequency (%) |
| 41 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 26 | 3 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 23 | 3 | < 0.1% |
| 22 | 4 | 0.1% |
| 21 | 7 | |
| 20 | 11 | |
| 19 | 11 |
loan_intent
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 98.8 KiB |
| MEDICAL | |
|---|---|
| EDUCATION | |
| VENTURE | |
| PERSONAL | |
| DEBTCONSOLIDATION |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 9.8694071 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PERSONAL |
|---|---|
| 2nd row | VENTURE |
| 3rd row | MEDICAL |
| 4th row | MEDICAL |
| 5th row | PERSONAL |
Common Values
| Value | Count | Frequency (%) |
| MEDICAL | 1412 | |
| EDUCATION | 1240 | |
| VENTURE | 1057 | |
| PERSONAL | 1013 | |
| DEBTCONSOLIDATION | 916 | |
| HOMEIMPROVEMENT | 687 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| medical | 1412 | |
| education | 1240 | |
| venture | 1057 | |
| personal | 1013 | |
| debtconsolidation | 916 | |
| homeimprovement | 687 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 8756 | |
| O | 6375 | |
| N | 5829 | |
| I | 5171 | |
| T | 4816 | |
| A | 4581 | 7.3% |
| D | 4484 | 7.2% |
| C | 3568 | 5.7% |
| M | 3473 | 5.6% |
| L | 3341 | 5.4% |
| Other values (7) | 12030 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 62424 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 8756 | |
| O | 6375 | |
| N | 5829 | |
| I | 5171 | |
| T | 4816 | |
| A | 4581 | 7.3% |
| D | 4484 | 7.2% |
| C | 3568 | 5.7% |
| M | 3473 | 5.6% |
| L | 3341 | 5.4% |
| Other values (7) | 12030 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 62424 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 8756 | |
| O | 6375 | |
| N | 5829 | |
| I | 5171 | |
| T | 4816 | |
| A | 4581 | 7.3% |
| D | 4484 | 7.2% |
| C | 3568 | 5.7% |
| M | 3473 | 5.6% |
| L | 3341 | 5.4% |
| Other values (7) | 12030 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 62424 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 8756 | |
| O | 6375 | |
| N | 5829 | |
| I | 5171 | |
| T | 4816 | |
| A | 4581 | 7.3% |
| D | 4484 | 7.2% |
| C | 3568 | 5.7% |
| M | 3473 | 5.6% |
| L | 3341 | 5.4% |
| Other values (7) | 12030 |
loan_grade
Categorical
High correlation 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 98.8 KiB |
| A | |
|---|---|
| B | |
| C | |
| D | |
| E | |
| Other values (2) | 45 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | B |
| 3rd row | A |
| 4th row | B |
| 5th row | C |
Common Values
| Value | Count | Frequency (%) |
| A | 2048 | |
| B | 2026 | |
| C | 1159 | |
| D | 837 | |
| E | 210 | 3.3% |
| F | 40 | 0.6% |
| G | 5 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 2048 | |
| b | 2026 | |
| c | 1159 | |
| d | 837 | |
| e | 210 | 3.3% |
| f | 40 | 0.6% |
| g | 5 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2048 | |
| B | 2026 | |
| C | 1159 | |
| D | 837 | |
| E | 210 | 3.3% |
| F | 40 | 0.6% |
| G | 5 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6325 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 2048 | |
| B | 2026 | |
| C | 1159 | |
| D | 837 | |
| E | 210 | 3.3% |
| F | 40 | 0.6% |
| G | 5 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6325 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 2048 | |
| B | 2026 | |
| C | 1159 | |
| D | 837 | |
| E | 210 | 3.3% |
| F | 40 | 0.6% |
| G | 5 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6325 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 2048 | |
| B | 2026 | |
| C | 1159 | |
| D | 837 | |
| E | 210 | 3.3% |
| F | 40 | 0.6% |
| G | 5 | 0.1% |
loan_amnt
Real number (ℝ)
High correlation 
| Distinct | 337 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10624.827 |
| Minimum | 1000 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 98.8 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 3000 |
| Q1 | 6000 |
| median | 9600 |
| Q3 | 14500 |
| 95-th percentile | 24000 |
| Maximum | 35000 |
| Range | 34000 |
| Interquartile range (IQR) | 8500 |
Descriptive statistics
| Standard deviation | 6263.5418 |
|---|---|
| Coefficient of variation (CV) | 0.5895194 |
| Kurtosis | 0.86761122 |
| Mean | 10624.827 |
| Median Absolute Deviation (MAD) | 4400 |
| Skewness | 0.96976248 |
| Sum | 67202032 |
| Variance | 39231956 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 460 | 7.3% |
| 5000 | 453 | 7.2% |
| 6000 | 360 | 5.7% |
| 15000 | 322 | 5.1% |
| 12000 | 280 | 4.4% |
| 3000 | 219 | 3.5% |
| 8000 | 206 | 3.3% |
| 20000 | 195 | 3.1% |
| 7000 | 194 | 3.1% |
| 9000 | 165 | 2.6% |
| Other values (327) | 3471 |
| Value | Count | Frequency (%) |
| 1000 | 45 | |
| 1075 | 1 | < 0.1% |
| 1150 | 1 | < 0.1% |
| 1200 | 17 | 0.3% |
| 1300 | 1 | < 0.1% |
| 1350 | 2 | < 0.1% |
| 1400 | 4 | 0.1% |
| 1450 | 2 | < 0.1% |
| 1500 | 37 | |
| 1600 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 36 | |
| 33000 | 1 | < 0.1% |
| 31000 | 1 | < 0.1% |
| 30000 | 28 | |
| 28250 | 1 | < 0.1% |
| 28000 | 12 | 0.2% |
| 27800 | 1 | < 0.1% |
| 27250 | 1 | < 0.1% |
| 27050 | 1 | < 0.1% |
| 27000 | 2 | < 0.1% |
loan_int_rate
Real number (ℝ)
High correlation 
| Distinct | 265 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.12824 |
| Minimum | 5.42 |
|---|---|
| Maximum | 23.06 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 98.8 KiB |
Quantile statistics
| Minimum | 5.42 |
|---|---|
| 5-th percentile | 6.03 |
| Q1 | 7.9 |
| median | 11.11 |
| Q3 | 13.49 |
| 95-th percentile | 16.35 |
| Maximum | 23.06 |
| Range | 17.64 |
| Interquartile range (IQR) | 5.59 |
Descriptive statistics
| Standard deviation | 3.2496083 |
|---|---|
| Coefficient of variation (CV) | 0.29201457 |
| Kurtosis | -0.7690113 |
| Mean | 11.12824 |
| Median Absolute Deviation (MAD) | 2.68 |
| Skewness | 0.15910106 |
| Sum | 70386.12 |
| Variance | 10.559954 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10.99 | 192 | 3.0% |
| 7.51 | 174 | 2.8% |
| 7.88 | 165 | 2.6% |
| 7.49 | 160 | 2.5% |
| 7.9 | 132 | 2.1% |
| 13.49 | 119 | 1.9% |
| 6.62 | 114 | 1.8% |
| 6.03 | 105 | 1.7% |
| 11.49 | 105 | 1.7% |
| 5.42 | 98 | 1.5% |
| Other values (255) | 4961 |
| Value | Count | Frequency (%) |
| 5.42 | 98 | |
| 5.79 | 80 | |
| 5.99 | 58 | |
| 6.03 | 105 | |
| 6.17 | 40 | 0.6% |
| 6.39 | 7 | 0.1% |
| 6.54 | 56 | |
| 6.62 | 114 | |
| 6.76 | 27 | 0.4% |
| 6.91 | 54 |
| Value | Count | Frequency (%) |
| 23.06 | 1 | |
| 21.36 | 2 | |
| 21.21 | 1 | |
| 20.89 | 2 | |
| 20.62 | 1 | |
| 20.52 | 1 | |
| 20.48 | 1 | |
| 20.25 | 1 | |
| 20.17 | 1 | |
| 20.16 | 1 |
loan_percent_income
Real number (ℝ)
High correlation 
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.18814609 |
| Minimum | 0 |
|---|---|
| Maximum | 0.56 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 98.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.05 |
| Q1 | 0.11 |
| median | 0.17 |
| Q3 | 0.26 |
| 95-th percentile | 0.39 |
| Maximum | 0.56 |
| Range | 0.56 |
| Interquartile range (IQR) | 0.15 |
Descriptive statistics
| Standard deviation | 0.1046068 |
|---|---|
| Coefficient of variation (CV) | 0.55598713 |
| Kurtosis | -0.10094991 |
| Mean | 0.18814609 |
| Median Absolute Deviation (MAD) | 0.07 |
| Skewness | 0.69345715 |
| Sum | 1190.024 |
| Variance | 0.010942583 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.13 | 763 | 12.1% |
| 0.07 | 378 | 6.0% |
| 0.17 | 258 | 4.1% |
| 0.23 | 239 | 3.8% |
| 0.15 | 209 | 3.3% |
| 0.19 | 193 | 3.1% |
| 0.09 | 193 | 3.1% |
| 0.11 | 180 | 2.8% |
| 0.14 | 179 | 2.8% |
| 0.16 | 178 | 2.8% |
| Other values (46) | 3555 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 0.01 | 28 | 0.4% |
| 0.02 | 16 | 0.3% |
| 0.03 | 120 | 1.9% |
| 0.04 | 96 | 1.5% |
| 0.05 | 121 | 1.9% |
| 0.06 | 155 | |
| 0.07 | 378 | |
| 0.08 | 160 | |
| 0.09 | 193 |
| Value | Count | Frequency (%) |
| 0.56 | 1 | < 0.1% |
| 0.53 | 2 | < 0.1% |
| 0.52 | 4 | 0.1% |
| 0.51 | 9 | 0.1% |
| 0.5 | 12 | |
| 0.49 | 10 | 0.2% |
| 0.48 | 16 | |
| 0.47 | 16 | |
| 0.46 | 16 | |
| 0.45 | 26 |
cb_person_default_on_file
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 5215 | |
| True | 1110 | 17.5% |
cb_person_cred_hist_length
Real number (ℝ)
High correlation 
| Distinct | 29 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8093281 |
| Minimum | 2 |
|---|---|
| Maximum | 30 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 98.8 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 14 |
| Maximum | 30 |
| Range | 28 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.0644719 |
|---|---|
| Coefficient of variation (CV) | 0.69964579 |
| Kurtosis | 3.6352869 |
| Mean | 5.8093281 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.6385602 |
| Sum | 36744 |
| Variance | 16.519932 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1201 | |
| 4 | 1148 | |
| 3 | 1118 | |
| 8 | 380 | 6.0% |
| 10 | 378 | 6.0% |
| 9 | 375 | 5.9% |
| 6 | 356 | 5.6% |
| 5 | 356 | 5.6% |
| 7 | 337 | 5.3% |
| 14 | 115 | 1.8% |
| Other values (19) | 561 |
| Value | Count | Frequency (%) |
| 2 | 1201 | |
| 3 | 1118 | |
| 4 | 1148 | |
| 5 | 356 | 5.6% |
| 6 | 356 | 5.6% |
| 7 | 337 | 5.3% |
| 8 | 380 | 6.0% |
| 9 | 375 | 5.9% |
| 10 | 378 | 6.0% |
| 11 | 88 | 1.4% |
| Value | Count | Frequency (%) |
| 30 | 3 | < 0.1% |
| 29 | 5 | |
| 28 | 5 | |
| 27 | 2 | < 0.1% |
| 26 | 5 | |
| 25 | 2 | < 0.1% |
| 24 | 8 | |
| 23 | 3 | < 0.1% |
| 22 | 7 | |
| 21 | 3 | < 0.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 4777 | |
| 1 | 1548 | 24.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 4777 | |
| 1 | 1548 | 24.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4777 | |
| 1 | 1548 | 24.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6325 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4777 | |
| 1 | 1548 | 24.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6325 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4777 | |
| 1 | 1548 | 24.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6325 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4777 | |
| 1 | 1548 | 24.5% |
age_of_first_emp
Real number (ℝ)
High correlation 
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.862925 |
| Minimum | 14 |
|---|---|
| Maximum | 74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 98.8 KiB |
Quantile statistics
| Minimum | 14 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 17 |
| median | 22 |
| Q3 | 26 |
| 95-th percentile | 36 |
| Maximum | 74 |
| Range | 60 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 6.8142444 |
|---|---|
| Coefficient of variation (CV) | 0.2980478 |
| Kurtosis | 4.2137538 |
| Mean | 22.862925 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.5946833 |
| Sum | 144608 |
| Variance | 46.433927 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 1347 | |
| 22 | 472 | 7.5% |
| 21 | 441 | 7.0% |
| 23 | 439 | 6.9% |
| 20 | 376 | 5.9% |
| 24 | 354 | 5.6% |
| 19 | 332 | 5.2% |
| 25 | 318 | 5.0% |
| 18 | 284 | 4.5% |
| 26 | 238 | 3.8% |
| Other values (41) | 1724 |
| Value | Count | Frequency (%) |
| 14 | 6 | 0.1% |
| 15 | 101 | 1.6% |
| 16 | 1347 | |
| 17 | 177 | 2.8% |
| 18 | 284 | 4.5% |
| 19 | 332 | 5.2% |
| 20 | 376 | 5.9% |
| 21 | 441 | 7.0% |
| 22 | 472 | 7.5% |
| 23 | 439 | 6.9% |
| Value | Count | Frequency (%) |
| 74 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 66 | 2 | |
| 65 | 1 | < 0.1% |
| 63 | 3 | |
| 60 | 1 | < 0.1% |
| 58 | 1 | < 0.1% |
| 57 | 2 | |
| 56 | 2 | |
| 55 | 3 |
Interactions
Correlations
| age_of_first_emp | cb_person_cred_hist_length | cb_person_default_on_file | id | loan_amnt | loan_grade | loan_int_rate | loan_intent | loan_percent_income | loan_status | person_age | person_emp_length | person_home_ownership | person_income | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age_of_first_emp | 1.000 | 0.556 | 0.027 | 0.013 | -0.034 | 0.045 | 0.058 | 0.059 | 0.019 | 0.059 | 0.647 | -0.621 | 0.089 | -0.064 |
| cb_person_cred_hist_length | 0.556 | 1.000 | 0.011 | 0.017 | 0.040 | 0.022 | -0.001 | 0.078 | -0.032 | 0.033 | 0.803 | 0.054 | 0.041 | 0.097 |
| cb_person_default_on_file | 0.027 | 0.011 | 1.000 | 0.024 | 0.076 | 0.621 | 0.575 | 0.028 | 0.080 | 0.211 | 0.000 | 0.056 | 0.099 | 0.020 |
| id | 0.013 | 0.017 | 0.024 | 1.000 | -0.101 | 0.037 | -0.064 | 0.034 | -0.127 | 0.136 | 0.007 | 0.003 | 0.029 | 0.042 |
| loan_amnt | -0.034 | 0.040 | 0.076 | -0.101 | 1.000 | 0.091 | 0.148 | 0.033 | 0.679 | 0.226 | 0.068 | 0.112 | 0.069 | 0.377 |
| loan_grade | 0.045 | 0.022 | 0.621 | 0.037 | 0.091 | 1.000 | 0.724 | 0.055 | 0.085 | 0.495 | 0.016 | 0.029 | 0.137 | 0.000 |
| loan_int_rate | 0.058 | -0.001 | 0.575 | -0.064 | 0.148 | 0.724 | 1.000 | 0.052 | 0.180 | 0.440 | 0.003 | -0.087 | 0.123 | -0.060 |
| loan_intent | 0.059 | 0.078 | 0.028 | 0.034 | 0.033 | 0.055 | 0.052 | 1.000 | 0.026 | 0.216 | 0.082 | 0.026 | 0.083 | 0.000 |
| loan_percent_income | 0.019 | -0.032 | 0.080 | -0.127 | 0.679 | 0.085 | 0.180 | 0.026 | 1.000 | 0.440 | -0.036 | -0.067 | 0.100 | -0.327 |
| loan_status | 0.059 | 0.033 | 0.211 | 0.136 | 0.226 | 0.495 | 0.440 | 0.216 | 0.440 | 1.000 | 0.042 | 0.099 | 0.280 | 0.031 |
| person_age | 0.647 | 0.803 | 0.000 | 0.007 | 0.068 | 0.016 | 0.003 | 0.082 | -0.036 | 0.042 | 1.000 | 0.099 | 0.039 | 0.145 |
| person_emp_length | -0.621 | 0.054 | 0.056 | 0.003 | 0.112 | 0.029 | -0.087 | 0.026 | -0.067 | 0.099 | 0.099 | 1.000 | 0.157 | 0.233 |
| person_home_ownership | 0.089 | 0.041 | 0.099 | 0.029 | 0.069 | 0.137 | 0.123 | 0.083 | 0.100 | 0.280 | 0.039 | 0.157 | 1.000 | 0.041 |
| person_income | -0.064 | 0.097 | 0.020 | 0.042 | 0.377 | 0.000 | -0.060 | 0.000 | -0.327 | 0.031 | 0.145 | 0.233 | 0.041 | 1.000 |
Missing values
Sample
| id | person_age | person_income | person_home_ownership | person_emp_length | loan_intent | loan_grade | loan_amnt | loan_int_rate | loan_percent_income | cb_person_default_on_file | cb_person_cred_hist_length | loan_status | age_of_first_emp | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7 | 7 | 21 | 20000 | RENT | 0.0 | PERSONAL | C | 2500 | 13.49 | 0.13 | Y | 3 | 0 | 21.0 |
| 10 | 10 | 30 | 78000 | MORTGAGE | 5.0 | VENTURE | B | 12800 | 10.59 | 0.17 | N | 5 | 0 | 25.0 |
| 15 | 15 | 29 | 33000 | OWN | 8.0 | MEDICAL | A | 7300 | 8.90 | 0.23 | N | 8 | 0 | 21.0 |
| 32 | 32 | 30 | 80000 | RENT | 3.0 | MEDICAL | B | 6000 | 10.75 | 0.07 | N | 8 | 0 | 27.0 |
| 37 | 37 | 22 | 68000 | MORTGAGE | 7.0 | PERSONAL | C | 15900 | 13.49 | 0.25 | N | 2 | 0 | 15.0 |
| 38 | 38 | 30 | 54000 | RENT | 0.0 | MEDICAL | B | 12500 | 11.71 | 0.24 | N | 10 | 1 | 30.0 |
| 62 | 62 | 28 | 60000 | RENT | 5.0 | PERSONAL | E | 17200 | 17.06 | 0.28 | Y | 8 | 0 | 23.0 |
| 68 | 68 | 31 | 62900 | MORTGAGE | 2.0 | MEDICAL | D | 18000 | 14.09 | 0.24 | N | 5 | 1 | 29.0 |
| 69 | 69 | 24 | 40000 | RENT | 3.0 | MEDICAL | C | 3000 | 13.22 | 0.07 | N | 4 | 0 | 21.0 |
| 73 | 73 | 24 | 24000 | RENT | 4.0 | MEDICAL | B | 3000 | 10.95 | 0.13 | N | 2 | 0 | 20.0 |
| id | person_age | person_income | person_home_ownership | person_emp_length | loan_intent | loan_grade | loan_amnt | loan_int_rate | loan_percent_income | cb_person_default_on_file | cb_person_cred_hist_length | loan_status | age_of_first_emp | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 58492 | 58492 | 21 | 85000 | RENT | 5.0 | PERSONAL | B | 10000 | 9.99 | 0.12 | N | 4 | 0 | 16.0 |
| 58498 | 58498 | 28 | 30000 | RENT | 0.0 | MEDICAL | A | 3000 | 7.49 | 0.10 | N | 8 | 0 | 28.0 |
| 58515 | 58515 | 22 | 30000 | RENT | 7.0 | EDUCATION | A | 3000 | 7.14 | 0.10 | N | 2 | 0 | 15.0 |
| 58532 | 58532 | 28 | 54000 | MORTGAGE | 5.0 | VENTURE | E | 5000 | 16.82 | 0.09 | N | 7 | 0 | 23.0 |
| 58546 | 58546 | 27 | 35000 | RENT | 3.0 | VENTURE | C | 3000 | 13.22 | 0.09 | N | 5 | 0 | 24.0 |
| 58549 | 58549 | 22 | 65000 | RENT | 3.0 | PERSONAL | A | 15000 | 6.54 | 0.23 | N | 3 | 0 | 19.0 |
| 58563 | 58563 | 28 | 54000 | RENT | 4.0 | DEBTCONSOLIDATION | E | 15000 | 16.35 | 0.28 | N | 9 | 1 | 24.0 |
| 58611 | 58611 | 21 | 24000 | RENT | 5.0 | MEDICAL | C | 10000 | 13.85 | 0.42 | N | 4 | 1 | 16.0 |
| 58621 | 58621 | 25 | 90400 | MORTGAGE | 9.0 | DEBTCONSOLIDATION | A | 9500 | 8.90 | 0.11 | N | 4 | 0 | 16.0 |
| 58640 | 58640 | 34 | 120000 | MORTGAGE | 5.0 | EDUCATION | D | 25000 | 15.95 | 0.21 | Y | 10 | 0 | 29.0 |